CDS

Accession Number TCMCG078C02348
gbkey CDS
Protein Id KAG0450661.1
Location complement(join(89327..89410,89502..89558,90005..90059,90167..90309,90585..90722,91794..91904,99460..99523,106588..106787,107309..107436,111655..111859,116788..116888,117136..117205,121864..121944,122933..123091))
Organism Vanilla planifolia
locus_tag HPP92_026819

Protein

Length 531aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA633886, BioSample:SAMN14973820
db_source JADCNL010000093.1
Definition hypothetical protein HPP92_026819 [Vanilla planifolia]
Locus_tag HPP92_026819

EGGNOG-MAPPER Annotation

COG_category S
Description NLE (NUC135) domain
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko03009        [VIEW IN KEGG]
KEGG_ko ko:K14855        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGAATGCGGCCAGAGGAGTTGGAGAGATGAACGGCAATAGCAGCGTTCTGTGCTTGTTGACGAATTCAGAAGGGACGGCTCTTGGTGAGGCTCTCTACCTTCCAGAGAACGTCGGCCCGCCTCAGTTGCAAGAAATTGTCAACAAGCTCCTTCAAAACGAGGAAAGGCTACCTTATGCTTTTTATATATCTGATCAAGAGCTTGTTGTTCAACTGGGTGCTTTTTACAAAAGGACAAAGGTGGGTGCATCACGAGAATCTATCATACTCAATGTGATAACATTGGTTGATTCAAGTGGAACTTGGTCAATAAGGGCTTCACCTGATGTAACTTCAAATGGCATTAAACTCTTCCTATATCTTACTTGGAAGGAATTCACCCAAATCAAGGAAGAGGACATAAAAGATATTGAAGAAGTTAATCCAACTTCAAGCATTGATTACGAATTTGTAAAGCTCAAAAAGATTAATTACTTAACCTTCTTTGCGGGACATGATGAAGCCATGCTTTACATTGCTTTTAGCCCGATAGCGAGGAATCGGCCGATGGATCCGGTGATGCCACAAAGAAGATTGTGGGACCTTAACACTCAAACACCATTATTCACATGCTCAGGACATAAAAACTGGGTTTTATGTATAGCATGGTCTCCTGATGGAAAACATCTCGTGAGTGGTAGCAAGTCAGGAGAGCTTATATTATGGGATCCACAAACTGGGAAGCAGTTCGGCAATTCTCTTACTGGCCATAAAAAGTGGATTACTGCTATTTCTTGGGAGCCTGTACATTTGCAATCTCCTTGTCACCGATTTGTCAGCTCTAGCAAGGATGGGGATGCTCGAATTTGGGATATTTCTTTGAGGAGGTGCACTATTTCCCTTACTGGTCACACCCTAGCAGTGACGTGTGTAAAGTGGGGCGGTGATGGAATGATATACACAAGTTCTCAGGATTGTACAATAAAGGTTTGGGAGACTTCTCAAGGAAAGTTGGTCCGAGAACTAAAGGGACATGGTCACTGGGTTAATTCGCTTGCTCTGAGCACAGAGTATACCCTTCGGACTGGGGCCTTTGATCATACTGGAAAGACGTTTACATCTGCACATGAGATGAAGGAGGCAGCACTAGAAAGATACAACAAAATGAATGGTAATGGTCCCGAAAGGCTCGTATCAGGTTCTGATGATTTCACTATGTTTCTTTGGGAACCTGCTATTAATAAACACCCGAAGGCTCGGTTAACAGGCCATCAACAGCTTGTAAACCATGTTTACTTCTCTCCGGATGGTCAGTGGTTGGCCAGTGCTTCGTTTGACAAATCCGTTAAGTTGTGGAATGGCATTACAGGGCAATTCATTGCATCCTTCAGAGGGCATGTTGGACCAGTATACCAAATTAGCTGGTCAGCTGACAGTAGGCTTCTTCTGAGTGGAAGTAAAGACTCTACGCTGAAGGTGTGGGATATCAGAACTCACAAATTAAAGCAGGATCTTCCAGGTCATGCAGATGAGGTTTTCTCAGTTGATTGGAGTCCAGATGGGGAGAAGGTGGCATCTGGTGGAAAGGATAGGGTACTCAAGTTGTGGATGGGTTAG
Protein:  
MNAARGVGEMNGNSSVLCLLTNSEGTALGEALYLPENVGPPQLQEIVNKLLQNEERLPYAFYISDQELVVQLGAFYKRTKVGASRESIILNVITLVDSSGTWSIRASPDVTSNGIKLFLYLTWKEFTQIKEEDIKDIEEVNPTSSIDYEFVKLKKINYLTFFAGHDEAMLYIAFSPIARNRPMDPVMPQRRLWDLNTQTPLFTCSGHKNWVLCIAWSPDGKHLVSGSKSGELILWDPQTGKQFGNSLTGHKKWITAISWEPVHLQSPCHRFVSSSKDGDARIWDISLRRCTISLTGHTLAVTCVKWGGDGMIYTSSQDCTIKVWETSQGKLVRELKGHGHWVNSLALSTEYTLRTGAFDHTGKTFTSAHEMKEAALERYNKMNGNGPERLVSGSDDFTMFLWEPAINKHPKARLTGHQQLVNHVYFSPDGQWLASASFDKSVKLWNGITGQFIASFRGHVGPVYQISWSADSRLLLSGSKDSTLKVWDIRTHKLKQDLPGHADEVFSVDWSPDGEKVASGGKDRVLKLWMG